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Abstract 

Background: Monopartite begomoviruses (genus Begomovirus, family Geminiviridae) that infect sweet potato 
{Ipomoea batatas) around the world are known as sweepoviruses. Because sweet potato plants are vegetatively 
propagated, the accumulation of viruses can become a major constraint for root production. Mixed infections of 
sweepovirus species and strains can lead to recombination, which may contribute to the generation of new 
recombinant sweepoviruses. 

Results: This study reports the full genome sequence of 34 sweepoviruses sampled from a sweet potato 
germplasm bank and commercial fields in Brazil. These sequences were compared with others from public 
nucleotide sequence databases to provide a comprehensive overview of the genetic diversity and patterns of 
genetic exchange in sweepoviruses isolated from Brazil, as well as to review the classification and nomenclature of 
sweepoviruses in accordance with the current guidelines proposed by the Geminiviridae Study Group of the 
International Committee on Taxonomy of Viruses (ICTV). Co-infections and extensive recombination events were 
identified in Brazilian sweepoviruses. Analysis of the recombination breakpoints detected within the sweepovirus 
dataset revealed that most recombination events occurred in the intergenic region (IR) and in the middle of the CI 
open reading frame (ORF). 

Conclusions: The genetic diversity of sweepoviruses was considerably greater than previously described in Brazil. 
Moreover, recombination analysis revealed that a genomic exchange is responsible for the emergence of 
sweepovirus species and strains and provided valuable new information for understanding the diversity and 
evolution of sweepoviruses. 
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Background 

Sweet potato {Ipomoea batatas, family Convolvulaceae) is 
one of the most important subsistence crops in developing 
countries and the third most important root crop after 
potato {Solanum tuberosum) and cassava (Manihot escu- 
lenta) [1]. More than 30 viruses are known to infect sweet 
potato and in some cases cause serious diseases in this 
crop [2]. Many of these viruses are monopartite begomo- 
viruses (genus Begomovirus, family Geminiviridae), 

The first two sweet potato begomoviruses to be char- 
acterized at the molecular level were Sweet potato leaf 
curl virus (SPLCV) and Sweet potato leaf curl Georgia 
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virus (SPLCGoV), isolated in Louisiana, USA, in 1999 
[3,4]. Subsequently, begomovirus infections in sweet 
potato have been reported from many countries, includ- 
ing Peru [5], Spain [6], China [7,8], Italy [9], Uganda 
[10], the United States [11] and Brazil [12,13], resulting in 
the description of ten additional novel species [6,7,10-13]. 
Phylogenetically, these viruses, for which the name swee- 
poviruses has been proposed [14], group in a monophyletic 
cluster that is distinct from the two main begomovirus 
branches, the Old and New World groups [6,15]. In 
addition to sweet potato, sweepoviruses can infect other 
hosts such as /. nil or /. setosa [16]. The symptoms caused 
by sweepoviruses depend on the specific host and usually 
consist of leaf curling and vein yellowing, although the in- 
fection can be asymptomatic. 
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Begomoviruses are transmitted to dicotyledonous 
plants by the whitefly Bemisia tabaci and cause import- 
ant yield losses in many crops worldwide [17-19]. They 
have small, circular, single-strand DNA genomes consist- 
ing of one (monopartite) or two (bipartite) components 
encapsidated in twinned icosahedral particles [20,21]. 
The viral DNA- A has one (VI) or two open reading 
frames (ORFs - VI and V2, in Old World begomo- 
viruses) in the virion sense and four ORFs (CI, C2, C3 
and C4) in the complementary sense, separated by an 
intergenic region (IR). The DNA- A encodes the viral 
coat protein (CP or VI) essential for viral transmission 
by B, tabaci and a V2 protein that may potentially be 
involved in virus accumulation, symptom development 
and virus movement [22,23]. The complementary-sense 
strand of DNA-A encodes the replication-associated 
protein (Rep or CI), the transcriptional activator protein 
(TrAP or C2), which controls viral gene expression, the 
replication-enhancer protein (REN or C3), required for 
viral DNA replication, and C4, a suppressor of post- 
transcriptional gene silencing (PTGS). The DNA-B of 
bipartite begomoviruses encodes two proteins, the 
nuclear shuttle protein (NSP - BVl) and the movement 
protein (MP - BCl) involved in intra- and inter-cellular 
movement within the plant [24] . 

Because sweet potato plants are vegetatively propa- 
gated, accumulation of viruses may occur and results in 
the co-infection of multiple viral genomes in a single 
plant. Mixed infections of sweepovirus species and 
strains have been previously shown to be frequent in 
sweet potato [6,11,13]. This phenomenon is extremely 
important for virus evolution because it provides oppor- 
tunities for the occurrence of natural recombination 
events leading to extensive viral diversity [25-27]. The 
importance of recombination to geminivirus evolution is 
well known [26,28,29], and it is probably the mechanism 
responsible for the genetic diversification and emergence 
of the most agriculturally important begomovirus species 
[30-32]. While generating descendants with increased 
fitness, recombination has also been the cause of the 
increased genetic diversity within the begomoviruses 
that consequently complicates the classification of new 
species. 

In this report, we present a study of the genetic diver- 
sity among sweepoviruses in Brazil. Thirty-four new 
complete sequences were determined. Based on these 
new sequences and on other sequences available in pub- 
lic sequence databases, the classification and nomencla- 
ture of sweepoviruses were revised in accordance with 
the current guidelines of the Geminiviridae Study Group 
of the International Committee on Taxonomy of Viruses 
(ICTV). We also provide clear evidence of recombin- 
ation events that may have led to the emergence of new 
sweepovirus strains and species. 



Results 

Sequence analysis of full-length sweepovirus genomes 

The complete nucleotide sequence of 34 cloned isolates 
(GenBank accessions HQ393442 to HQ393472 and 
HQ393474 to HQ393476) corresponding to putative 
full-length sweepovirus genomes was determined from 
the sweet potato Embrapa germplasm bank (SPEGB) 
and from commercial field samples. All genomes (ran- 
ging from 2779 to 2843 nucleotides) had the typical 
organization of monopartite begomoviruses with two 
ORFs in the virion sense (VI and V2) and four ORFs in 
the complementary sense (CI, C2, C3 and C4). All 
sequences contained the conserved nonanucleotide 
sequence 5'-TAATATT|AC-3' and four iterative ele- 
ments (iterons, short repeated sequences important for 
the replication process; Additional File 1), three direct 
(I, II and III) and one inverted (IV), with the core con- 
sensus sequence GGWGR located around the TATA box 
[33]. The iteron-related domain (IRD) in the N- terminal 
region of the replication-associated protein (Rep IRD) 
was also identified [34]. Sequences were identified that 
contain three Rep IRDs (MATPKRFRIS, MAPPNRFKIQ 
and MPRAGRFNLN) that differ from those previously 
described by Lozano et al [6] and Zhang and Ling [11] 
for sweepoviruses (Additional File 1). 

The 34 sequences determined here were compared 
with the sequences of 67 sweepovirus isolates obtained 
from sequence databases (Table 1 and Additional File 2). 
Each isolate was named following the standard nomen- 
clature for begomoviruses (Table 2; see Additional File 2 
for complete isolate names). Based on the current guide- 
lines proposed by the ICTV Geminiviridae Study Group 
[35,36], two isolates belong to the same species if the 
overall nucleotide identity is >89%. The isolates 
described in this study belong to three species. Sweet 
potato leaf curl virus (SPLCV), Sweet potato golden vein 
virus (SPGVV) (the term "associated" was eliminated 
from the previous name, sweet potato golden vein- 
associated virus, following the standard begomovirus 
naming recommendation) and Sweet potato leaf curl 
Spain virus (SPLCESV), with the percentage of nucleo- 
tide identity ranging from 92.2-98.4% within each spe- 
cies. Twenty-one of the isolates belong to three novel 
strains (the demarcation threshold for distinguishing dif- 
ferent strains of a species is 89-94% nucleotide identity 
[36]) of SPLCV and SPGVV named SPLCV-Brazil 
(SPLCV-BR), SPLCV-Pernambuco (SPLCV-PE) and 
SPGVV-Rondonia (SPGVV-RO). The other isolates are 
variants of SPLCV-United States (SPLCV-US), SPGVV- 
Paraiba (SPGVV-PB) and SPLCESV. The isolates of 
SPGVV and SPLCESV were found only in the SPEGB, 
whereas SPLCV isolates were found in both the SPEGB 
and commercial field samples. The samples from Sao 
Paulo state were infected by both SPLCV-US and 



Albuquerque et at. Virology Journal 2012, 9:241 
http://www.vi rologyj.conn/content/9/1/241 



Page 3 of 1 3 



Table 1 Sweepoviruses used in this study 



Species-Strain 


Accession no 


Identification according 
to databases 


Origin 


Reference 


Acronym suggested 
in this study 




HQ393450 


SPLCV-US[BR:PA:08] ^ 


Brazil 


This study 


SPLCV-US[BR:PA:08] 


SPLCV-US 


H0393451 


SPLCV-USrBR'SE'PV'081 


Brazil 


Thl^ ^tuHv 

1 1 II J jiuvi y 


SPLCV-USrBR'SE'PV'081 




H0^9^44^ 


SPLCV-USFBR'BA'CAI '081 


Brazil 


1 1 11 J jiuuy 


SPLCV-USFBR'BA'CAI •081 




HQ393446 


SPLCV-US[BR:BA:CA2:08] 


Brazil 


This study 


SPLCV-US[BR:BA:CA2:08] 




FJ969834 


SPLCV-RS2[BR:Estl] 


Brazil 


Paprotka Pt al., 2010 


SPLCV-US[BR:RS:Estl :07] 




F |QfiQ«^7 
1 J y\jyo.j 1 


SPI rV-RS^TRR-Rnqll 


Brazil 


Panrntka pt ?\ 7010 


SPI r\/-[)SrRR-RS-Rn<;1-071 




FJ96983S 


SPLCV-RS2[BR:Mac1] 


Brazil 


Panrntka pt al ?01 0 


SPLCV-US[BR:RS:Macl :07] 




H0^9^471 


SPLCV-USFBR'SP'AMI '091 


Brazil 


1 1 11 J jiuuy 


SPLCV-USrBR'SP'AMI '091 




H0^9^472 


SPLCV-USrBR*SP*AM2*091 


Brazil 


1 1 11 J jiLiuy 


SPLCV-USrBR*SP*AM2*091 




F |Qf=,983fS 


SPI r\/-RS?rRR-Pna11 


Brazil 


Panrntka pt al 7010 


SPI rV-l ISTRR-RS-Pnal -071 




H0^9^47'5 

1 l\^«.#7<i..#^/ «/ 


SPLCV-USrBR*SP*AM4*091 


Brazil 


1 1 11 J jiuuy 


SPLCV-USrBR*SP*AM4*091 




H0^9^474 


SPLCV-USrBR*SP*AM3*091 


Brazil 


1 1 11 J jiuuy 


SPLCV-USrBR*SP*AM3*091 




1 1 1 V 1 / ^^Vj^ 1 


SPI CV-rHapnamll 

V LI icicri icii i i ij 


Sni ith KnrP3 


VJCI 1 LJCI 1 1 1\ 


SPI CV-USTKR-Hap 1-091 

J>\ l_V_ V LJ-JLIM \.l ICIC 1 \JJ\ 




1 1 1 V I / ^n^\J J / 


SPI C\l-\y^(S\c^c^ 5071 


Sni ith \\C\x(^7\ 


fZpnRank' 

vJCI 1 UO 1 1 1\ 


SPI rV-l ISrKR-Ypn507-091 




FJS60719 


SPI CKrV-rj-5081 


Sni Ith Knrppi 


GpnBank 


SPLrV-USrKR-Ypn508-081 




HM7S4639 

1 1 1 V 1 / ^^VJ J J 


SPI rV-rHapnam5321 


Sni ith Knrppi 

JLJULI 1 Ix^^l CCI 


VJCI 1 DC! 1 1 1\ 


SPI rV-USrKR-Hap532-091 

jri_^v LJ-jLixix.i icic^-JZ- .w 




1 1 1 V 1 / ^^vj JO 


SPI CV-rHapnam 519-31 

i_v„ V LI icicri icii \ \ ^ \ j o\ 


Sni ith Knrppi 


VJCI 1 VjO 1 1 1\ 


SPI rV-USrKR-Hap5 19-091 

J>\ l_V_ V L-'-JLlXl \.l ICIC^ 1 J XJ J \ 




HM 75463 5 


cpi rw-rvpoioo 3881 


Sni Ith Knrppi 


GpnBank 


SPLCV-USrKR-Ypn388-091 




HM754636 


SPLCV-[Nonsan 445-2] 


South Korpa 


GpnBank 


SPLCV-US[KR;Non445-2:09] 




HM754634 


SPLCV-[Chungju 263] 


South Korpa 


GpnBank 


SPLCV-US[KR;Chu263:09] 




HM 754640 


SPLCV-[Hapnam 618-2] 


South Korpa 


GpnBank 


SPLCV-US[KR;Hap61 8:09] 




AF 104036 


SPLCV-USrUS-Lou-941 


USA 


1 ntrakul iR/ Valvprdp 1999 

i_^^Liar\ui 0( vaivcimc:/ 1 J J J 


SPLCV-USrUS-Lou-941 




H0393453 


SPLCV-USrBR'RO'OPO'081 


Brazil 


Thi^ ^tuHv 


SPLCV-USrBR*RO'OPO'081 




F 11 76701 

1 J 1 I \J I \J \ 


SPI CV-rFastprn Chinal 

-jr i_v_ V Li—CioLCi 1 1 v„i III icij 




vjci 1 Da 1 1 1\ 


SPI rv-usrrN-iia-08i 






SPI rV-riaran-Kvntn-l 9981 


jTpal 


VJCI 1 DC! 1 1 1\ 


SPI rV-USriR-Kvn-981 




H0333141 


SPLCV-[US:I\/1S:WS1 -4] 


USA 


Zhang & Ling, 201 1 


S P L CV- U S rU S - M S - WS 1 -4-0 71 




HQ333142 


SPLCV-[US;MS:WS3-8] 


USA 


Zhang & Ling, 201 1 


SPLCV-US[US:MS:WS3-8:07] 




HO333140 


SPI rv-rus-)\/is-4R-i4i 


USA 


Zhann (R/ 1 inn 701 1 

z_i ICII ly oc 1—11 ly^ 1 1 


SPI rV-USrUS-)\/lS-4b- 14-071 




HQ333139 


SPLCV-[US;MS:1B-1A] 


USA 


Zhang & Ling, 201 1 


SPLCV-US[US:MS:lb-la;07] 


SPLCV-CN 


FU767799 


SPLCV-rRL71 

J> \ V [1 \l_ / J 


China 


GpnBank 


SPLCV-CN [CN:Yn:RL7:07] 




FN806776 


SPLCV-rY3381 


China 


GpnBank 


SPLCV-CN rCN-Yn338-091 




EU253456 


SPLCV-[RL31] 


China 


GpnBank 


SPLCV-CN [CN:Yn:RL3 1:07] 


SPLCV-SP 


HQ393473 


SPLCV-SP[BR:AlvM:09] 


Brazil 


Albuguprc|up Pt al., 201 1 


SPLCV-SP[BR:SP:AlvM:09] 




H0393476 


SPLCV-SPrBR'SP'PP'091 


Brazil 


Thl^ ^tuHv 

1 1 II J jiuviy 


SPLCV-SPrBR'SP*PP'091 


SPLCV-JP 


AB433787 


5 P LCV- [Ja pa n : Ku m a m oto: 1 998] 


Japan 


GpnBank 


SPLCV-JP [J R:KurTi:98] 




AB433786 


SPLCV-[Japan:Miyazaki:l 996] 


Japan 


GpnBank 


SPLCV-JP [J R:Miy:96] 


SPLCV-SC 


HQ333138 


SPLCV-[US;SC:646B-1 1] 


USA 


Zhang & Ling, 201 1 


SPLCV-SC[US:SC:646-B1 1:06] 




nU333 13/ 


brLLV-LUb.bL.634- /I 


USA 


Zhang & Ling, zu i i 


brLLV-bLLUb.bL.634-/.U6J 




HQ333135 


SPLCV-[US:SC:377-23] 


USA 


Zhang & Ling, 2011 


SPLCV-SC[US:SC:377-23:06] 




HQ333136 


SPLCV-[US:SC:634-2] 


USA 


Zhang & Ling, 2011 


SPLCV-SC[US:SC:634-2:06] 


SPLCV-BR 


HQ393445 


SPLCV-BR[BR:BA:CA:08] 


Brazil 


This study 


SPLCV-BR[BR:BA:CA:08] 




HQ393460 


SPLCV-BR[BR:RO:Cac:08] 


Brazil 


This study 


SPLCV-BR[BR:RO:Cac:08] 




HQ393455 


SPLCV-BR[BR:RO:OPO:08] 


Brazil 


This study 


SPLCV-BR[BR:RO:OPO:08] 




HQ393449 


SPLCV-BR[BR:SE:Ria:08] 


Brazil 


This study 


SPLCV-BR[BR:SE:Ria:08] 




HQ393442 


SPLCV-BR[BR:BA:Uru:08] 


Brazil 


This study 


SPLCV-BR[BR:BA:Uru:08] 
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Table 1 Sweepoviruses used in this study (Continued) 



c n 1 ^\ / n n 

5PLCV-PR 


DQ644563 


5PLCV-LN41 


Puerto Rico 


GenBank 


SPLCV-PRLPR:Me-N4:06] 




DQ644562 


5PLCV-LPR80-N2] 


Puerto Rico 


GenBank 


SPLCV-PR[PR:80-N2:06] 


SPLCV-PE 


HQ393456 


SPLCV-PE[BR:RO:PV:08] 


Brazil 


This study 


SPLCV-PE[BR:RO:PV:08] 




HQ393464 


SPLCV-PE[BR:RS:IVIP1 :09] 


Brazil 


This study 


SPLCV-PE[BR:RS:IVIP1 :09] 




HQ393465 


SPLCV-PE[BR:RS:lvlP2:09] 


Brazil 


This study 


SPLCV-PE[BR:RS:lvlP2:09] 




HQ393467 


SPLCV-PE[BR:RS:lvlP5:09] 


Brazil 


This study 


SPLCV-PE[BR:RS:lvlP5:09] 




HQ393462 


SPLCV-PE[BR:PE:CSF:08] 


Brazil 


This study 


SPLCV-PE[BR:PE:CSF:08] 




HQ393463 


SPLCV-PE[BR:PE:CSF:08] 


Brazil 


This study 


SPLCV-PE[BR:PE:CSF:08] 




HQ393461 


SPLCV-PE[BR:PB:PF:09] 


Brazil 


This study 


SPLCV-PE[BR:PB:PF:09] 




HQ393466 


SPLCV-PE[BR:RS:IVIP4:09] 


Brazil 


This study 


SPLCV-PE[BR:RS:IVIP4:09] 




HQ393469 


SPLCV-PE[BR:RS:IVIP7:09] 


Brazil 


This study 


SPLCV-PE[BR:RS:MP7:09] 




HQ393468 


SPLCV-PE[BR:RS:IVIP6:09] 


Brazil 


This study 


SPLCV-PE[BR:RS:IVIP6:09] 




HQ393470 


SPLCV-PE[BR:RS:IVIP3:09] 


Brazil 


This study 


SPLCV-PE[BR:RS:IVIP3:09] 


SPLCV-Fu 


FJ515898 


SPLCV-[Fp-3] 


China 


Yang et al., 2009 


SPLCV-Fu[CN:Fj:Fp3:07] 




FJ515897 


SPLCV-[Fp-2] 


China 


Yang et al., 2009 


SPLCV-Fu[CN:Fj:Fp2:07] 




FJ5 15896 


SPLCV-[Fp-l] 


China 


Yang et al., 2009 


SPLCV-Fu[CN:Fj:Fpl :07] 


SPLCV-ES 


EU856364 


SPLCV-ES[ES;CI:BG 1 2:02] 


Spain 


Lozano et al., 2009 


SPLCV-ES[ES:CI:BG1 2:02] 




EF456744 


SPLCV-ES[ES:CI:BG6:02] 


Spain 


Lozano et al., 2009 


SPLCV-ES[ES:CI:BG6:02] 




EU856366 


SPLCV-ES[ES:CI:BG 13:02] 


Spain 


Lozano et al., 2009 


SPLCV-ES[ES:CI:BG1 3:02] 


SPLCV-IT 


AJ586885 


SPLCV-IT[IT:Sic:02] 


Italy 


Briddon et al., 2005 


SPLCV-IT[IT:Sic:02] 


SPLCLaV-BR 


FJ969833 


SPLCV-RSl [BR:Tavl] 


Brazil 


Paprotka et al., 2010 


SPLCLaV-BR[BR:RS:Tavl :07] 


SPLCLaV-ES 


EF456746 


SPLCLaV-[ES:CI:BG27:02] 


Spain 


Lozano et al., 2009 


SPLCLaV-ES[ES:CI:BG27:02] 




EU839579 


SPLCLaV-[ES:Mal:BG30:06] 


Spain 


Lozano et al., 2009 


SPLCLaV-ES[ES:Mal:BG30:06] 


SPLCBeV 


FN432356 


SPLCBV-[lndia:West Bengal:2008] 


India 


GenBank 


SPLCBeV-[IN:Ben:08] 


5PLCBRV 


FJ969832 


5PLCV-CELBR:Forl] 


Brazil 


Paprotka et al., 2010 


SPLCBRV-LBR:CE:Forl :07] 


SPLCCaV 


EF456742 


SPLCCaV-[ES:CI:BG4:02] 


Spain 


Lozano et al., 2009 


SPLCCaV-[ES:CI:BG4:02] 




EF456745 


SPLCCaV-[ES:CI:BG7:02] 


Spain 


Lozano et al., 2009 


SPLCCaV-[ES:CI:BG7:02] 




EU856365 


SPLCCaV-[ES:CI:BG21 :02] 


Spain 


Lozano et al., 2009 


SPLCCaV-[ES:CI:BG21 :02] 




FJ529203 


SPLCCaV-[ES:CI:BG25:02] 


Spain 


Lozano et al., 2009 


SPLCCaV-[ES:CI:BG25:02] 


SPLCShV 


EU309693 


SPLCV 


China 


GenBank 


SPLCShV-[CN:Sha:07] 


SPLCGoV 


AF326775 


SPLCGV-[16] 


USA 


Lotrakul et al., 2003 


SPLCGoV-[US:Geo:16] 


MerLCuV-US 


HQ333143 


SPGVaV-[US:MS:l B-3] 


USA 


Zhang & Ling, 201 1 


MerLCuV-US[US:MS:l B-3:07] 


MerLCuV-BR 


FJ969829 


SPGVaV-PA[BR:Bell] 


Brazil 


Paprotka et al., 2010 


MerLCuV-BR[BR:PA:Bel 1 :07] 


MerLCuV-PR 


DQ644561 


MeLCV-[PR:Nl] 


Puerto Rico 


GenBank 


MeLCuV-PR[PR:Nl:06] 


lYVMaV 


EU839576 


IYW-[ES:Mal:IGl:06] 


Spain 


Lozano et al., 2009 


IYVMaV-[ES:Mal:IGl :06] 


lYW 


EU839578 


IYW-[ES:Mal:IG5:06] 


Spain 


Lozano et al., 2009 


IYW-[ES:Mal:IG5:06] 




AJ 132548 


IYW-[ES:98] 


Spain 


Banks et al., 1999 


IYW-[ES:98] 




EU839577 


i\AA / rnr k/i-^i i/^n r\r~\ 

IYVV-LE5:Mal:IG3:06] 


Spain 


Lozano et al., 2009 


i\A A / rnp K yi 1 i/^n r\r~\ 

IYW-LES:Mal:IG3:06] 


SPLCSPV 


HQ393477 


SPLCSPV-[BR:AlvM:09] 


Brazil 


Albuquerque et al., 201 1 


SPLCSPV-[BR:SP:AlvM:09] 


SPMV 


FJ969831 


SPMaV-[BR:BSBl] 


Brazil 


Paprotka et al., 2010 


SPMV-[BR:BSB1:07] 


SPLCSCV 


HQ333144 


SPLCSCV-[US:SC:648B-9] 


USA 


Zhang & Ling, 2011 


SPLCSCV-[US:SC:648-B9:06] 


SPLCUV 


FR751068 


SPLCUV-[UG:KAMP:08] 


Uganda 


Wasswa et al., 201 1 


SPLCUV-[UG:KAMP:08] 


SPLCESV 


EF456741 


SPLCESV-[ES:CI:BG1:02] 


Spain 


Lozano et al., 2009 


SPLCESV-[ES:CI:BG1:02] 




EF456743 


SPLCESV-[ES:CI:BG5:02] 


Spain 


Lozano et al., 2009 


SPLCESV-[ES:CI:BG5:02] 




HQ393448 


SPLCESV-[BR:BA:Uti:08] 


Brazil 


This study 


SPLCESV-[BR:BA:Uti:08] 




FJ151200 


SPLCESV-[ES:Mal:IG2:06] 


Spain 


Lozano et al., 2009 


SPLCESV-[ES:Mal:IG2:06] 




HQ393458 


SPLCESV-[BR:RO:Cac:08] 


Brazil 


This study 


SPLCESV-[BR:RO:Cac:08] 
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Table 1 Sweepoviruses used in this study (Continued) 



SPGW-PB 


HQ393444 


SPGVV-PB[BR:BA:CA:08] 


Brazil 


This study 


SPGW-PB[BR:BA:CA:08] 






QDr'X/DX/ DR1 TRD-Q^M ill 
jroVdV-rD 1 LDlioOU IJ 


Br3zil 


raproiKa ei ai., zuiu 


oroV V-r DLDn.r d.jOU I.U/J 


SPGW-RO 


HQ393459 


SPGVV-RO[BR:RO:Cac:08] 


Brazil 


This study 


SPGVV-RO[BR:RO:Cac:08] 




HQ393457 


SPGVV-RO[BR:RO:PV2:08] 


Brazil 


This study 


SPGVV-RO[BR:RO:PV2:08] 




HQ393454 


SPGVV-RO[BR:RO:OPO:08] 


Brazil 


This study 


SPGW-RO[BR:RO:OPO:08] 




HQ393447 


SPGVV-RO[BR:BA:Uti:08] 


Brazil 


This study 


SPGW-RO[BR:BA:Uti:08] 




HQ393452 


SPGVV-R0[BR:SE:PV1 :08] 


Brazil 


This study 


SPGVV-R0[BR:SE:PV1 :08] 


SPLCCNV 


DQ5 12731 


SPLCV-[CN] 


China 


Luan et a!., 2007 


SPLCCNV-[CN:05] 




JF736657 


SPLCV-B3 


China 


GenBank 


SPLCCNV-[CN;Zhe:10] 



^The isolates present in this study are shown in bold. 

"^Complete names of the viruses are shown in the Additional file 2. 



SPLCV-Sao Paulo (SPLCV-SP) isolates. However, the 
samples from the States of Pernambuco, Paraiba and Rio 
Grande do Sul were shown to be infected by SPLCV-PE 
(Table 2), Interestingly, we identified co-infection in six 
samples (#171, #184, #325, #337, #346 and #370) from 
the SPEGB, whereas all of the samples from commercial 
fields were apparently infected by a single species/strain, 
as suggested by the uniformity of the clones obtained 
from these samples (Table 2). 



Taxonomic and phylogenetic analysis of sweepoviruses 

Pairwise comparisons using Clustal V were performed 
using the sequences determined here and all full-length 
sweepovirus sequences available in the databases. It is 
worth noting that a few sweepovirus isolates were likely 
misclassified according to the taxonomic criteria for gemi- 
nivirus classification [36] (the proposed new names are 
shown in Table 1 and Additional File 2). These isolates 
were SPLCV-Ceara[Brazil:Fortalezal] (SPLCV-CE[BR: 
Fori], FJ969832), SPLCV-Rio Grande do SuU [Brazil: 
Tavaresl] (SPLCV-RSl[BR:Tavl], FJ969833) [12], Sweet 
potato golden vein-associated virus- [United States: 
Mississipi: lb-3:07] (SPG VaV- [US:MS: lB-3] , HQ333143) 
[11], SPGVaV-Para[Brazil:Beleml] (SPGVaV-PA[BR:Bell], 
FJ969829) [12] and Ipomoea yellow vein virus- [Spain:Mal- 
aga:IG 1:2006] (IYW-[ES:Mal:IGl:06], EU839576) [6]. The 
genome of isolate SPLCV-CE[BR:Forl] shares <89% nu- 
cleotide identity with all other begomovirus sequences 
(Additional File 2), and in accordance with the cut-off 
point of 89% identity established for species separation 
within the genus Begomovirus [35], it most likely belongs 
to a new species, proposed here as Sweet potato leaf curl 
Brazil virus (SPLCBRV). The isolate SPLCV-RSl[BR:Tavl] 
shares >90% identity with SPLCLaV isolates, and we there- 
fore proposed it be classified as Sweet potato leaf curl 
Lanzarote virus-Brazil [BR:RS:Tavl:07] (SPLCLaV-BR[BR: 
RS:Tavl:07]). The IYW-[ES:Mal:IGl:06] sequence shared 
<89% identity with lYW- [Spain: 1998] and all other bego- 
movirus sequences; it is therefore suggested that it be 



classified as a new species named Ipomoea yellow vein 
Malaga virus- [ES:Mal:IG 1:06] (IYVMaV-[ES:Mal:IGl:06]). 
In addition, based on the nucleotide sequence identities 
found (Additional File 2), we propose that the isola- 
tes SPG VaV- [United States:Mississippi:lB-3] (SPGVaV- 
[US:MS:lB-3], HQ333143), SPGVaV-[Brazil:Beleml] 
(SPGVaV-PA[BR:Bell], FJ969829) and Merremia leaf curl 
virus- [Puerto Rico:Nl] (DQ644561) be classified as strains 
of Merremia leaf curl virus (MerLCuV), specifically MerL- 
CuV-US[US:MS:lB-3:07], MerLCuV-BR[BR:PA:Bell:07] 
and MerLCuV-PR[PR:Nl:06], respectively. 

In the UPGMA phylogenetic tree (Figure 1), the swee- 
povirus sequences were consistently grouped in accord- 
ance with the proposed species/strain classification and 
were separated from both the Old and New World bego- 
moviruses as was expected from the pairwise nucleotide 
identity analysis (Additional File 2). 



Recombination analysis 

We searched for evidence of recombination in an align- 
ment of all 101 complete sweepovirus sequences. Differ- 
ent methods were used for recombination breakpoint 
prediction and provided strong evidence for at least 13 
recombination events spread across 19 of 101 analyzed 
genomes (Figure 2). Remarkably, following the adopted 
criteria (detectable by seven different analytical methods 
and recombined fragments with >97% nucleotide iden- 
tity with parental sequences), most recombination events 
were detected among the isolates from the SPEGB. Eight 
recombination patterns were detected for sequences 
reported in this work, while five were found in the previ- 
ously published sweepovirus sequences. The recombin- 
ation breakpoints were detected between the intergenic 
region (IR) and VI (events 1, 2, and 5); VI and C2/C3 
(event 3); CI and CI (event 4), VI and C2 (event 6), VI 
and CI (event 7); V2 and C2/C3 (event 9); C1/C4 and 
V1/V2 (event 12) and CI and IR (events 8, 10, 11, and 
13) (Figure 2). After analysis with the RDP3 program, 
the recombination events detected for the sequences 
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Table 2 Origins of the 34 sweepovirus isolates used in this study 



Sampl6 OriQin 


Sampl6 


Enzyme 


Species 


Strain [Isolate] 


Acronym 


Accession No 


SPEGB^ 


#167 


EcoRV 


SPLCV^ 


UnitpH StatPsTRra/il-Rahia-r ru7 Has Almasl 90081 

1 M LCLJ LU LC-J \_D\ OZ- \ l.DO IMU.^IUZ. LJU-J / \MMC1-Jl .j!L\J\jKjj 


SPLCV-US[BR:BA:CA1 :08] 




SPEGB 


#1 71 


Sad 




IJnitpH Statpc;rRra7il-Rahia-r riiz Ha^; Almac;790081 

1 M LCU ~J La LCo \_D\ OZ- \ l.DO IMCl.v^lUZ- Uuo / \MI ICloZ..Z.WWL)J 


SPI rV-USrRR-RA-rA2-081 


H0393446 


SPEGB 


#293 


EcoRV 




ilnitpH Srarp<;rRra7il-Para-20081 


SPI rV-l ]SrRR-PA-081 


HO3934S0 


SPEGB 


#325 


Sad 




UnitpH StatP'^rRrazil-RnnHnnia-Pnrtn Vplhn90081 

\J I Wl^Kj -J LU LCD [LJ 1 aZ-l 1 .1 \VJ 1 Ivj^^l \\0,\ Kjl IKJ veil IKJ ,Z-\J\jKjj 


SPLCV-US[BR:RO:PV:08] 


H03934S1 

1 1 J _/ J^ J 1 


SPEGB 


#337 


EcoRV 




1 1 1 LCU -J Lu LCo [D 1 CIZ_II .1 \kJ I lU LJI 1 ICI.W UILJ FICLLJ KJkJ WCo LCZ-WWOJ 


SPI rV-USrRR-RO-OPO-081 


H03934S3 

1 IV^J-Zj^JJ 


5go P3Ulo 


#SP12 


BamHI 




United States[Brazil:Sao Paulo:Alfredo Marcondesl;2009] 


SPLCV-US[BR:SP:AM1 :09] 


HQ393471 


Qan Pp\\ iln 


#SP1 2 


Sad 




IJnitpH Statpc;rRra7il-San Paiiln-AlfrpHn MarrnnHp<;7900Ql 


SPI rV-lJSrRR-SP-AM2-0Ql 


H0393477 

1 1 J J^ / Z. 


Qpin Pai iln 


#SP88 


RamHI 

DO 1 1 1 1 M 




ilnitpH Statp<;rRra7iKan Pai iln-AlfrpHn MarrnnHp<;3-200Ql 

uiiiLCU J)iaiCoiu\ a^w.Ou'u rauikj.rAii i cukj iviai L-Ui iucjJ.z.vjvJ^j 


SPI rV-l ]SrRR-SP-AM3-0Ql 


H0393474 


San Pai iln 


#SP130 


BamHI 




IJnitpH StatPsTRra/iKan Paiiln'AlfrpHn MarcnnHps4900Ql 


SPLCV-US[BR:SP:AM4:09] 


H039347S 


Qan Pp\\ iln 


#SP140 


RamHI 

DO MM M 




San PaiilnrRrayihSan PaMln-Prp<;iHpntp PniHpntp-^OOQl 

-jOKj rciui'^LLJiciZ-M.-jci'^ rciuim.ricoiuciiLC riuuciiLC.z.ww_/j 


SPI CV-SPTRR-SP-PP-OQl 

-jr i_^ V -jr [Ul \.-jr .r r 


H0393476 


SPEGB 


#1 34 


Spel 




Rra7iirRra7il-Rahia-[ In in ira-70081 

L-)ICIZ.IILL)ICIZ.II.L)CII IICI.^I LICLIC CI.Z.\JVJOJ 


SPI rV-RRTRR-RA-l jrii-081 


H0393447 


SPEGB 


#1 71 


BamHI 




Brazil[Brazil:Baliia:Cruz das Almas:2008] 


SPLCV-BR[BR:BA:CA:08] 


H039344S 


SPEGB 


#235 


RamHI 

UO MM M 




Rra7iirRra7il-Sprninp-Riarhan90081 

Ul CIZ_IILL)I CIZ_ll.-Jd yi[^CI\ICIL_l \0\J.^\J\J uj 


SPI rV-RRrRR-SF-Ria-081 

l_^ V Dl \LL-)I \. J l_. 1 MCI.WUJ 


H0393449 


SPEGB 


#337 


Sad 




Rra7iirRra7il-RnnHnnia-Oi irn Prptn Hn Opstp90081 

LJIuZ-MLDICIZ-M.lxLJI lULJI 1 ICI. WU \ kJ F I C LLJ KJKJ WCj LCZ-WWUJ 


SPI rV-RRrRR-RO-OPO-081 

i_^ V Di \ LL)i \.i xw.wr w.woj 


H03934S5 

1 lV^w)_/w)^JJ 


SPEGB 


#370 


Sad 




Rra7iirRra7il-RnnHnnia-r arna 190081 


SPI CV-RRrRR-RO-farORl 

l_V_ V L)l xLLJI \.l \W.^CIL_.WL)J 


HO393460 


SPEGB 


#346 


EcoRV 




Pprnamhi irnrRra7il'RnnHnnia'Pnrtn \/plhn9nnRl 
rci 1 loi 1 luuL-wLuiaz-iLrvLji iulji iio.rLji ikj vcii \\J,z^\J\JO] 


SPI rV-PFrRR-RO-PV-081 

-jr V r i_LL)r\.r\w.r v.vjoj 


H03934S6 
1 1 v^-j :7-j^_'U 


Parpiih?! 

1 a 1 a 1 LJO 


#PB82 


Sad 




pprnpirpihi i(--pvrRra7il'Paraiha'PpHras Hp Fnnn90081 


SPLCV-PE[BR:PB:PF:08] 


H0393461 


PprnpirfiKi ifpv 
r CI 1 loll 1 1 kjUL-i^ 


#PE49 


RamHI 

DO MM M 




Pprnpii^Ki i(--(~\rRra7il'Ppnamhi irn'Gamnrin Hp San FpHyI 900^1 

rciiiaiiikjUL.^^LLJiaZ-MXCiiaiiikjUL.'^.v_.ciiiiLjL.Mi uc ~jokj i cmai ,z.\j\jd] 


SPI CV-PFrRR-PF-rSFl -081 

-jr i_^ V r i_LL-)i \.r i_.v„-ji i .woj 


H0393467 

1 iv^w)_/w)^vjz. 


Pprn^m hi irn 
r CI 1 lai 1 1 uuL-kj 


#PE49 


Sad 




Pprnamhi irnrRra7il'Pprnamhi irn'r~amnrin Hp San FpliY99nnRl 
rciiiaiiiuuL.w[_uiaz.ii,rciiioiiikjUL.Lj.v_ciiiiLjL.iii uc -jokj i ci \az.,z^\j\jo] 


SPI rV-PFrRR-PF-rSF?-081 
or V r i_LL)r\.r l_.v-ji z..vjoj 


H0393463 
1 iv_yj:7j^u-j 


Rin f-iranrlp Hn Siil 

rMU' 1 Cl 1 lUC \J\J -JUI 


#RS9 


RamHI 

DO MM M 




Pprnamhi irnrRra7il'Rin nranHp Hn Si il'N/lariana Pimpntpl 1 90001 
rciiiciiiiuuL.LJLLJiciz.ii.rMLj yioiiuc kjkj -jui.iviciiiciiici riiiiciiLCii ,z.\j\jy] 


SPI rV-PFTRR-RS-MPI -OQl 
-jri_v_v r i_Lur\.r\ J.I vir i .vj_7j 


H0393464 


Rio Gr3nd6 do Sul 


#RS9 


Sad 




Pernambuco[Brazil;Rio Grande do SuhMariana Pimentel2;2009] 


SPLCV-PE[BR:RS:MP2:09] 


HQ393465 


Rin Granrip Hn '^iil 

IMl^ 1 Cl 1 lUC VJKJ -JUI 


#RS24 


Sad 




pprnamhi irnrRra7il'Rin GranHp Hn SiiI'Mariana Pimpntpl4-900Q1 

rCIIICllllkJUL.'^LLJIClZ-M.lxH^ VJIClllUC JUI.IVICIIICIIICI rMIICIILCI^.Z.WWyj 


SPI rV-PFrRR-RS-MP4-0Ql 

-jr i_^ V r i_ [Ul \. 1 V 1 r _/j 


H0393466 


Rin Granrip Hn '^iil 

IMl^ 1 Cl 1 lUC -JUI 


#RS29 


RamHI 

DO MM M 




Pprnamhi irnrRra7il'Rin GranHp Hn SiihMariana PimpntplS900Ql 

r \ \0\ I \\UKJ^KJ\_D\ OZ-\\,\\\KJ VJIClllUC '^JKJ JUI.IVICIIICIIICI rMIICIILCI^.Z.WWyj 


SPI CV-PFrRR-RS-MPS-OQl 

-jr i_v_ V r i_llji \. 1 \ J.I V 1 r j.w_/j 


H0393467 


Rin f-iranHp Hn Siil 

rMU' 1 Cl 1 lUC \J\J -JUI 


#RS33 


RamHI 

DO MM M 




Pprnamhi irnrRra7il'Rin f-iranHp Hn Si il'N/lariana Pimpntplf^900Ql 
rciiiciiiiuuL.wLL)iciz.ii,r\iLj vjiciiiuc kjkj -jui.iviciiiciiici riiiicii lci vj.z.vjvj;yj 


SPI rV-PFrRR-RS-MP6-0Ql 


H0393468 


Rin f-iranHp Hn Siil 

rMU' 1 Cl 1 lUC \J\J -JUI 


#RS52 


RamHI 

DO MM M 




pprnamhi irnrRra7il'Rin f-iranHp Hn Si il'N/lariana Pimpntpl7900Ql 
rciiiciiiikjUL.LJLUiciz.ii.rMLj vjiciiiuc kjkj -jui.iviciiiciiici riiiiciiLCi / .z.\j\jy] 


SPI rV-PFrRR-RS-MP7-0Ql 


H0393469 


Rio Gr3nd6 do Sul 


#RS52 


Sad 




Pernambuco[Brazil!Rio Grande do SuhMariana Pimentel3!2009] 


SPLCV-PE[BR:RS:MP3:09] 


HQ393470 


SPEGB 


#171 


BamHI 


SPGVV"^ 


Paraiba[Brazil:Bahia:Cruz das Almas:2008] 


SPGW-PB[BR:BA:CA:08] 


HQ393444 


SPEGB 


#184 


Sad 




Rondonia[Brazil:Bahia:Utinga:2008] 


SPGW-RO[BR:BA:Uti:08] 


HQ393447 


SPEGB 


#325 


Sad 




Rondonia[Brazil:Rondonia:Porto Velhol :2008] 


SPGW-RO[BR;RO:PV1;08] 


HQ393452 


SPEGB 


#337 


EcoRV 




Rondonia[Brazil:Rondonia:Ouro Preto do Oeste:2008] 


SPGW-RO[BR;RO:OPO:08] 


HQ393454 


SPEGB 


#346 


EcoRV 




Rondonia[Brazil:Rondonia:Porto Velho2:2008] 


SPGW-RO[BR;RO:PV2:08] 


HQ393457 


SPEGB 


#370 


BamHI 




Rondonia[Brazil:Rondonia:Cacoal:2008] 


SPGW-RO[BR:RO:Cac:08] 


HQ393459 


SPEGB 


#184 


Spel 


SPLCESV^ 


[Brazil:Bahia:Utinga:2008] 


SPLCESV-[BR:BA:Uti:08] 


HQ393448 


SPEGB 


#370 


BamHI 




[Brazil;Rondonia:Cacoal;2008] 


SPLCESV-[BR:RO;Cac:08] 


HQ393458 



oweei puLdLU criiurdpd uerriipidbiii DdriK.. 
^ Sweet potato leaf curl virus. 

Sweet potato golden vein virus. 
^ Sweet potato leaf curl Spain virus. 

reported in this study (events 1-8, Figure 2A, 2B) were 
tested using the SimPlot program (Figure 3). Every event 
identified by the RDP3 program was confirmed by Sim- 
plot. SPLCV-US[BR:RO:OPO:08] (HQ393453) appeared 
to be a recombinant (recombination points detec 
ted at nucleotide (nt) positions 37 and 1006) of 
the putative parental-like strains SPLCV-SP[BR:SP: 
AlvM:09] (HQ393476) and SPLCV-BR[BR:RO:OPO:08] 
(HQ393455) (Figure 3A). Among the isolates from the 
SPLCV-BR strain, SPLCV-BR[BR:BA:Uru:08] (HQ393442) 



contained three recombination events (event 2, break- 
points at nucleotide positions 58-523; event 3, nt posi- 
tions 955-1325 and event 4, nt positions 1926-2614), 
three putative parental-like strains: SPGVY-RO 
[BR:BA:Uti:08] (HQ393447), SPLCV-US[BR:RO:OPO:08] 
(HQ393453) and an unknown sequence (Figure 3B). For 
the SPLCV-BR isolates [BR:BA:CA:08] and [BR:RO: 
Cac:08], two recombinant events were detected (event 2, nt 
positions 45-541 and event 4, nt positions 1926-2614), 
whereas SPLCV-BR[BR:RO:OPO:08] contained only event 
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■ SPLCV 

■ SPLCLaV 

■ SPLCBeV 

■ SPLCBRV 

■ SPLCCaV 

■ SPLCShV 

■ SPLCGoV 

■ MerLCuV 

■ lYVMaV 

■ lYVV 

■ SPLCSPV 

■ SPMV 

■ SPLCSCV 

■ SPLCUV 
SPLCESV 

■ SPGVV 

■ SPLCCNV 





» SPLCV-US[BR:PA:08] HQ393450 

► SPLCV-US[BR:SE:PV:08] HQ393451 
»SPLCV-US[BR:BA:CA1:08] HQ393443 
» SPLCV-US[BR:BA:CA2:08] HQ393446 

► SPLCV-US[BR:RS:Est1:07] FJ969834 
» SPLGV-US[BR:RS:Ros1:07] FJ969837 

► SPLCV-US[BR:RS:Mac1:07] FJ969835 
» SPLCV-US[BR:SP:AM1:09] HQ393471 

► SPLCV-US[BR:SP:AM2:09] HQ393472 
» SPLCV-US[BR:RS:Poa1:07] FJ969836 
» SPLCV-US[BR:SP:AM4:09] HQ393475 
» SPLCV-US[BR:SP:AM3:09] HQ393474 

► SPLCV-US[KR:Hae1:09] HM754641 

► SPLCV-US[KR:Yeo507;09] HM754637 
» SPLCV-US[KR:Yeo508;08] FJ560719 

► SPLCV-US[KR:Hae532:09] HM754639 

* SPLCV-US[KR:Hae519:09] HM754638 
» SPLCV-US[KR:Yeo388;09] HM754635 

» SPLCV-US[KR:Non445-2:09] HM754636 

* SPLCV-US[KR:Chu263:09] HM754634 
t SPLCV-US[KR:Hae618:09] HM754640 

► SPLCV-US[US:Lou;94] AF1 04036 

» SPLCV-US[BR:RO:OPO:08] HQ393453 

I SPLCV-US[CN:Jia:08] FJ1 76701 

» SPLCV-US[JR:Kyo:98] AB433788 

» SPLCV-US[US:MS:WS1-4:07] HQ333141 

* SPLCV-US[US:MS:WS3-8:07] HQ333142 
» SPLCV-US[US:MS:4b-14:07] HQ333140 

► SPLCV-US[US:MS:1b-1a;07] HQ333139 
» SPLCV-CN[CN:Yn:RL7:07] EU267799 

► SPLCV-CN[CN:Yn338:09] FN806776 

» SPLCV-CN[CN:Yn:RL31:07] EU253456 
» SPLCV-SP[BR:SP:AlvM:09] HQ393473 

► SPLCV-SP[BR:SP:PP:09] HQ393476 
SPLCV-JP[JR:Kum:98] AB433787 
SPLCV-JP[JR:Miy:96] AB433786 

» SPLCV-SC[US:SC:646-B11:06] HQ333138 
I SPLCV-SC[US:SC:634-7:06] HQ333137 
» SPLCV-SC[US:SC:377-23:06] HQ333135 

► SPLCV-SC[US:SC:634-2:06] HQ333136 
SPLCV-BR[BR:BA:CA:08] HQ393445 
SPLCV-BR[BR:RO:Cac:08] HQ393460 
SPLCV-BR[BR:RO:OPO:08] HQ393455 
SPLCV-BR[BR:SE:Ria:08] HQ393449 
SPLCV-BR[BR:BA:Uru:08] HQ393442 

» SPLCV-PR[PR:Me-N4:06] DQ644563 

► SPLCV-PR[PR:80-N2:06] DQ644562 

» SPLCV-PE[BR:RO:PV:08] HQ393456 
» SPLCV-PE[BR:RS:MP1:09] HQ393464 

► SPLCV-PE[BR:RS:MP2:09] HQ393465 

* SPLCV-PE[BR:RS:MP5:09] HQ393467 
» SPLCV-PE[BR:PE:CSF:08] HQ393462 

► SPLCV-PE[BR:PE:CSF:08] HQ393463 
» SPLCV-PE[BR:PB:PF:09] HQ393461 

» SPLCV-PE[BR:RS:MP4:09] HQ393466 
» SPLCV-PE[BR:RS:MP7:09] HQ39346 
» SPLCV-PE[BR:RS:MP6:09] HQ393468 
» SPLCV-PE[BR:RS:MP3:09] HQ393470 
SPLCV-Fu[CN:Fj:Fp3:07] FJ515898 
SPLCV-Fu[CN:Fj:Fp2:07] FJ515897 
SPLCV-Fu[CN:Fj:Fp1:07] FJ515896 
SPLCV-ES[ES:CI:BG12:02] EU856364 
SPLCV-ES[ES:CI:BG6:02] EF456744 
SPLCV-ES[ES:CI:BG13:02] EU856366 

* SPLCV-IT[IT:Sic:02] AJ586885 
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Figure 1 UPGMA phylogenetic tree based on a multiple alignment of the complete sequences of the sweepoviruses described in this 
work (in bold) and those available in public sequence databases. Brandies were bootstrapped witli 1,000 replications. Acronyms are 
described in Table 1. Representative sequences are included for New World (LI 4460, Tomato mottle virus: ToMoV-[US:Flo:89]; U51 137, Abutilon 
mosaic virus: AbMV-[US:Haw]; AFl 73555, Bean golden yellow mosaic virus: BGYMV-[MX:Chi]; and EF417915, Tomato yellow vein streak virus: 
ToYVSV-[BR:Ba3]) and Old World begomoviruses (AFl 12352, African cassava mosaic virus: ACMV-[CM:98]; AY705380, Cotton leaf curl Burewala 
virus: CLCuBuV-[IN:Ban:04]; X74516, AYW-SG[SG:92]; and AF071228, Tomato yellow leaf curl virus: TYLCV-Mdl[ES:72:97]). The scale bar indicates the 
number of substitutions per site. Bootstrap values >50% are indicated. 



2 (Figure 2B), The isolate SPLCESV-[BR:RO:Cac:08] 
(HQ393458) was identified as a recombinant (between 
nt positions 994-2770) of SPLCV-BR[BR:BA:CA:08] 
(HQ393445) and SPLCESV-[BR:BA:Uti:08] (HQ393448) 



(Figure 3C). When SPGW-PB[BR:BA:CA:08] (HQ393444) 
was used as a query sequence, a recombinant breakpoint 
(event 6, nt positions 698-1589) with two putative 
parental-like viruses, SPLCESV-[BR:RO:Cac:08] and 
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Figure 2 Schematic representation of recombination events (A) and details of the recombination breakpoints (B) detected in 
sweepoviruses. The genome organization of a typical sweepovirus is sliown at tine top of tine figure. Eacli genome is represented by an open 
box, colored according to the isolate. Numbers indicate recombination events described in B. R, G, B, M, C, S and T indicate detection by RDP, 
GENCONV, BOOTSCAN, MAXCHI, CHIMAERA, SISCAN and 3SEQ methods, respectively, with the presented highest p-value being that determined 
by the method indicated in bold type. 
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(See figure on previous page.) 

Figure 3 Evidence of recombination events in Brazilian sweepoviruses (A) SPLCV-US[BR:RO:OPO:08], (B) SPLCV-BR[BR:BA:Uru:08], (C) 
SPLCESV-[BR:RO:Cac:08], (D) SPGVV-PB[BR:BA:CA:08], (E) SPGW-RO[BR:RO:Cac:08] and (F) SPLCV-SP[BR:SP:PP:09]. SimPlot analyses were 
performed witli full genome sequence alignments using the Window 200bp, Step 20bp, GapStrip on Kimura (2-parameter) method. 
Recombination points are shown by vertical lines. 



SPLCV-US[BR:PA:08] (HQ393450) (Figure 3D) was iden- 
tified. In contrast, when SPGW-RO[BR:RO:Cac:08] 
(HQ393459) was used as a query sequence, a recombinant 
breakpoint at nt positions 581-1727 (event 7) was identi- 
fied with two putative parental-like viruses, SPLCESV-[BR: 
RO:Cac:08] and an unknown sequence (Figure 3E). The 
remaining four SPGW-RO sequences contained the same 
recombination event observed for SPGW-RO [BR:RO: 
Cac:08] (Figure 2B). Finally, when the analysis was per- 
formed for SPLCV-SP[BR:SP:PP:09] (HQ393476), two dif- 
ferent recombination points (at nt positions 24-2007) and 
two putative parental strains, SPLCV-US[BR:RS:Rosl:07] 
(FJ969837) and SPLCSPV-[BR:SP:AlvM:09] (HQ393477), 
were detected (Figure 3F). 

Discussion 

RCA (rolling circle amplification) has greatly facilitated 
the cloning of geminivirus genomes. This is especially 
true for sweet potato samples, as DNA extraction is dif- 
ficult due to the high polysaccharide content [37]. 
Because RCA-based methods may start from a low 
amount of template DNA, sample dilution is sufficient 
to avoid the harmful effects of contaminating substances. 
In total, 34 complete sweepovirus genomes were isolated 
from sweet potato samples collected from a sweet potato 
germplasm bank (SPEGB) and commercial fields across 
four Brazilian states. Based on ICTV guidelines [36], the 
isolates belong to new strains of SPLCV [38] (strains 
SPLCV-BR and SPLCV-PE), and of SPGVV [12] (strain 
SPGW-RO), and 13 other isolates are considered to 
be variants of SPLCV-US [38], SPLCV-SP [13] and 
SPLCESV [6]. 

A thorough pairwise comparison of all of the sweepo- 
virus sequences available in the databases along with the 
34 sequences reported here was performed, and it 
appeared that four viral isolates were not appropriately 
classified; hence, their classification was reviewed 
according to ICTV guidelines. We found that the iso- 
lates described as SPLCV-CE [12] and IYVV-[ES:IG1] 
[6] would be better classified as novel species, suggested 
here as Sweet potato leaf curl Brazil virus (SPLCBRV) 
and Ipomoea yellow vein Malaga virus (lYVMaV), re- 
spectively. Additionally, the SPLCV-RSl isolate [12] may 
be classified as a new strain of SPLCLaV [6], and the 
name Sweet potato leaf curl Lanzarote virus-Brazil 
(SPLCLaV-BR) is proposed. Thus, it is suggested that all 
SPLCLaV isolates from Spain be classified as SPLCLaV-ES. 
Similarly, the isolates designated as SPGVaV-[US: 



MS:lB-3], SPGVaV-PA[BR:Bell] and Merremia leaf 
curl virus-[PR:Nl] (MeLCV-[PR:Nl]) would be bet- 
ter renamed Merremia leaf curl virus- [US:MS:1B- 
3:07] (MerLCuV-US-[US:MS:lB-3:07]), MerLCuV-BR 
[BR:PA:Bell:07] and MerLCuV-[PR:Nl:06], respect- 
ively. Additionally, we suggest eliminating the term 
"associated" from the virus names, as symptom expression 
was not studied for any of the species. Finally, it was con- 
cluded that 44 of 67 sequences should be designated as 
strains (Table 1). This result clearly illustrates the com- 
plexity of sweepovirus taxonomy and nomenclature. 
Therefore, a set of modifications is suggested for updating 
the sweepovirus nomenclature and facilitating the inter- 
pretation of sweepovirus phylogenetic analysis (Table 1 
and Figure 1). 

The phylogenetic analysis demonstrated that the swee- 
povirus diversity found in the SPEGB samples is higher 
than in samples collected from commercial fields. In the 
SPEGB samples (n=10), three species (SPLCV, SPGVV 
and SPLCESV), five strains (SPLCV-US, SPLCV-BR, 
SPLCV-PE, SPGW-PB and SPGW-RO) and 11 recom- 
binants were found, while in commercial fields (n=ll), 
only one species (SPLCV) and three different strains 
(SPLCV-US, SPLCV-SP and SPLCV-PE) were observed. 
Moreover, co-infections were found solely in the SPEGB 
samples, although they have been shown to be frequent 
in field samples from other countries [6,11]. This could 
be explained by the vegetative propagation of sweet po- 
tato that favors viral accumulation in the roots, the 
maintenance of many sweet potato entries collected 
throughout Brazil and abroad in the same confined 
screenhouse, and the presence of whiteflies (Bemisia 
tabaci) in the germplasm bank facilities. Of the ap- 
proximately 1400 entries in the SBEGB, nine are from 
Argentina, the USA, Paraguay, Japan and Spain, and 
173 were received from the International Potato Center 
(CIP), Peru, as supposedly virus-free in vitro seedlings. 
The seedlings were maintained in vivo for at least three 
years, and this might have resulted in the natural 
spread of the begomoviruses present in some plants, 
thus enabling recombinations to occur. Virus-free 
plants or seeds will be produced to reduce the negative 
effect of virus infection for the breeding program. 

Patterns of inter-species/strain geminivirus recombin- 
ation and a number of hot- and cold-spots have been 
described among members of the genus Begomovirus 
[32,39-41]. We therefore analyzed the recombination 
breakpoints detected within the sweepovirus dataset and 
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a number of recombination events were identified. In 
some cases (Figure 3A, C, D and F), the recombinants 
appeared to be the result of recent recombination events 
because a low mutation rate was observed in this region 
(data not shown). Most of the recombination break- 
points occur in the IR (detected between nt positions 
2770 and 96) and in the middle of the CI ORF (between 
nt positions 2000 and 2250) (Figure 2A). Similarly, three 
recombination breakpoints (events 1, 3 and 5) were 
identified next to the end of the VI ORF (Figure 2A). 
These results are consistent with those obtained from 
geminivirus recombination analyses, which show that 
the Rep, the IR and the interface between ORFs VI and 
C3 are recombination hot-spots [29,40,42,43]. Lefeuvre 
et al [40] also described the presence of a recombin- 
ation cold-spot within the V2 ORF and the third quarter 
of the VI ORF of begomoviruses; here, however, we 
detected the occurrence of recombination breakpoints in 
the first half of the VI ORF. Some sweepovirus recom- 
bination events have been previously described [6,11], 
and most were confirmed in our study (events 9-13, 
Figure 2). The detection of similar recombination pat- 
terns agrees with the recent hypothesis that the recom- 
bination sites are non-randomly distributed along the 
geminivirus genome [29,39,44]. The observed recombin- 
ation patterns are most likely due to the existence of 
regions with higher biochemical and biophysical predis- 
position and with tolerance for recombination. The 
experimental generation of recombinants has shown that 
the IR and VI /C3 interface is a recombination hot-spot 
[43], and this can be explained by the fact that recombi- 
nants derived from recombination breakpoints that 
occur outside the genes are generally more viable than 
those occurring within the genes [29]. In addition to se- 
quence homology, secondary structural features might 
also favor the occurrence of recombination [43]. In our 
case, we detected a number of recombination break- 
points in the Rep and this was also observed among 
other geminiviruses [26,29,45]. 

Conclusions 

Our study shows that the genetic diversity of sweepo- 
viruses both in the SPEGB and commercial crops in 
Brazil is considerably greater than previously reported 
by Paprotka et al, [12] and highlights the importance of 
recombination in the evolution of these viruses. These 
results indicate that recombination events are apparently 
responsible for the emergence of sweepovirus strains 
and species, although alterations in host range, cell trop- 
ism, viral symptoms and pathogenicity remain to be elu- 
cidated. Recently, the generation of the first infectious 
clone of a sweepovirus, SPLCLaV, was described [16], 
which is especially important as it opens the possibility 
of understanding the various aspects of pathogenicity as 



well as its potential use in breeding virus-resistant sweet 
potatoes. Studies on viral diversity in particular regions 
provide important information enabling recommenda- 
tions for viral control strategies and are essential for 
identifying species/strains from which to select isolates 
for screening tests for resistant germplasm. Finally, stud- 
ies on viral diversity are necessary for comprehending 
how sweepovirus diversification results from propagative 
material exchange within the country. The 'in vivo' 
maintenance of vegetatively propagated plants in a 
germplasm bank has proven to be a risky strategy 
because it may enable or accelerate the generation of 
new species/strains that can spread to nature if isolation 
conditions are not sufficient to maintain the bank free of 
insect vectors. 

Methods 

Collection of leaf samples and DNA extraction 

Sweet potato leaf samples showing a variety of symp- 
toms, including vein thickening, chlorosis, curling, mot- 
tling and distortion, were collected from the sweet 
potato germplasm bank of Embrapa Vegetables (Brasilia- 
DF, Brazil) (SPEGB) and commercial fields across four 
Brazilian states (Table 2). Total DNA was extracted fol- 
lowing the protocol described by Doyle and Doyle [46] . 

Cloning strategy 

Circular geminiviral DNA was amplified by rolling-circle 
amplification (RCA) using (|)-29 DNA polymerase 
(TempliPhi kit, GE Healthcare) as described by Inoue- 
Nagata et al [47]. RCA products were digested with a set 
of restriction enzymes (BamHI, EcoRI, EcoRV, Hindlll, 
Kpnl, PstI, Sad, SacII, Spel and Xbal) to identify unique 
sites for cloning the full-length genomes (-2.8 kb). The 
restricted fragments corresponding to putative full-length 
monomer genomes were cloned into the vector pBlue- 
script SK(+) (Stratagene, California, USA) and fully 
sequenced at Macrogen Inc. (Seoul, South Korea). One to 
two enzymes were finally selected for cloning the viral 
genomes present in each sample (Table 2). 

Genetic diversity 

Pairwise nucleotide identity comparisons were calculated 
using Clustal V [48] (included in MegAlign DNASTAR 
Inc., Madison, WI, USA). As recommended by the ICTV 
Geminiviridae Study Group, viruses with nucleotide 
identity between full-genome sequences of <89% were 
considered as distinct species, while those with <94% 
were considered distinct strains of the same species [36]. 

Phylogenetic analysis 

Full genome sequences from 34 virus isolates obtained 
from 21 samples analyzed in this study (Table 2) and the 
67 complete sweepovirus sequences available in public 
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sequence databases (http://www.ncbi.nlm.nih.gov/) as of 
October 2011 (Table 2) were aligned using Muscle [49]. 
The phylogenetic relationships were inferred using 
UPGMA with 1,000 bootstrap replicates, and the evolu- 
tionary distances were calculated using the p-distances 
method implemented in MEGA 5 [50]. 

Recombination analysis 

All sweepovirus sequences used in this study (Table 1) 
were aligned using Muscle with default settings [49], and 
the detection of potential recombinant sequences, the 
identification of likely parental sequences and the 
localization of possible recombination breakpoints were 
performed using RDP [51], GENCONV [26], BOOT- 
SCAN [52], MAXICHI [53], CHIMAERA [54], SiScan 
[55] and 3SEQ [56] methods implemented in the RDP3 
program [57]. Default settings were used throughout. 
Only those potential recombination events detected 
using all of the methods described above and involving 
fragments sharing >97% sequence identity with their 
parental sequences were considered. Putative recombin- 
ation events were analyzed with the SimPlot program 
[58] using the putative recombinant sequence as a 
query. 
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